Picture for Wen Huang

Wen Huang

D-VLA: A High-Concurrency Distributed Asynchronous Reinforcement Learning Framework for Vision-Language-Action Models

Add code
May 14, 2026
Viaarxiv icon

Missing Old Logits in Asynchronous Agentic RL: Semantic Mismatch and Repair Methods for Off-Policy Correction

Add code
May 12, 2026
Viaarxiv icon

Thousand-GPU Large-Scale Training and Optimization Recipe for AI-Native Cloud Embodied Intelligence Infrastructure

Add code
Mar 11, 2026
Viaarxiv icon

3ViewSense: Spatial and Mental Perspective Reasoning from Orthographic Views in Vision-Language Models

Add code
Mar 08, 2026
Viaarxiv icon

ATA: Bridging Implicit Reasoning with Attention-Guided and Action-Guided Inference for Vision-Language Action Models

Add code
Mar 02, 2026
Viaarxiv icon

RL-VLA$^3$: Reinforcement Learning VLA Accelerating via Full Asynchronism

Add code
Feb 05, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

A Data-Centric Approach to Generalizable Speech Deepfake Detection

Add code
Dec 24, 2025
Figure 1 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 2 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 3 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Figure 4 for A Data-Centric Approach to Generalizable Speech Deepfake Detection
Viaarxiv icon

Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation

Add code
Nov 10, 2025
Viaarxiv icon

From Sharpness to Better Generalization for Speech Deepfake Detection

Add code
Jun 13, 2025
Viaarxiv icon